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SPECIFICATION 



SKIP LIST DATA STORAGE DURING COMPILATION 



FIELD OF THE INVENTION 

The invention relates to optimizing compilers 
and methods of compiling. More particularly, the 
invention relates to optimizing routines used in 
compiling which require a data flow analysis. 

BACKGROUND OF THE INVENTION 

Compilers are generally used to transform one 
representation of a computer program procedure into 
another representation. Typically, but not exclusively, 
compilers are used to transform a human readable form of 
a program such as source code into a machine readable 
form such as object code. 

One type of compiler is an optimizing compiler 
which includes an optimizer or optimizing module for 
enhancing the performance of the machine readable 
representation of a program. Some optimizing compilers 
are separate from a primary compiler, while others are 
built into a primary compiler to form a multi-pass 
compiler. Both types of compilers may operate either on 
a human readable form, a machine readable form, or any 
intermediate representation between these forms. 

Many optimizing modules of compilers operate 
on intermediate representations of computer programs or 
procedures. Typically a program or procedure being 
translated is broken down into a series of "statements", 
each of which contains zero or more "operands" or "data 
items". A data item may be "defined", meaning that it 
is given a value by the statement, or "used", meaning 
that its value is fed into the computation represented 
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by the statement. For example, the statement "x = y + 
z" defines x and uses y and z. Optimization of a 
program often involves locating individual statements or 
groups of statements which can be eliminated or 
5 rewritten in such a way as to reduce the total number of 
statements in the program or in a particular flow path 
through the program. For example, a complicated 
expression might be computed at two distant points 
within the same procedure. If the variables used in the 
10 expression are not modified to contain different values 
between the first and second computations, the value can 
be computed only once, at the first point in the 
procedure, and saved in a temporary location for use at 
the second point in the procedure, thus avoiding 
15 recomputation at the second point. This particular form 
of optimization is known as "common ( sub) expression 
elimination" . 

The main problem in optimizing a procedure is 
to determine at which points of the procedure various 
20 kinds of information are available. For example, to 
perform common ( sub) expression elimination, it is 
necessary to know at which points the variables used by 
the procedure are modified. To determine such facts, a 
dataflow analysis is performed on the program. 
25 To perform dataflow analysis, possible paths 

of execution through a procedure may be represented by a 
control flow graph (CFG) . Statements may be grouped 
together into basic blocks, which are maximal sequences 
of straight-line code. In other words, there is no way 
30 to branch into or out of a basic block except at the 
beginning or end. A CFG is a graph with one node for 
each basic block in the procedure. The CFG includes an 
arc from block A to block B if it is possible for block 
B to be executed immediately after block A has been 
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executed. In such a case, B is called a "successor" of 
A, and A is called a "predecessor" of B. 

The CFG is generated by a forward pass through 
the procedure to identify basic blocks and transitions 
between basic blocks, and form an ordered representation 
of those blocks and the branches between blocks. One 
well-known approach for ordering the blocks in the CFG 
is to form a "depth first" ordering of the basic blocks 
of the program. This approach is described in Alfred V. 
Aho, Ravi Sethi, and Jeffrey D. Ullman, Comioil ^r.s : 
Principles, Techniques, and Tools . Addison-Wesley, 
copyright 1986, reprinted 1988, which is incorporated by 
reference herein, particularly in sections 10.6 and 
10.9. In a depth first ordering, each basic block is 
assigned a "dfo" number, with the following property: if 
every path from the start of the program to block Y must 
pass through block X, then the dfo number for X is less 
than the dfo number for Y, which is written 
dfo (X) < dfo (Y) . 

After generating a CFG, optimization typically 
involves computing various properties at points of 
interest in the procedure, for example, the properties 
of the statements in each block in the CFG. Often, a 
matrix of binary values (bits) such as is shown in Fig. 
1, is used to identify these properties. In a typical 
approach, there are several rows 10 in the matrix for 
each block in the program, each row 10 representing one 
property of the statements in the block. There is one 
column 12 in the matrix for each property of interest 
during optimization. At each row and column location, 
there is a bit which has either a "1" or a "0" value. 

For example, in the matrix shown in Fig. 1, 
each block B is associated with four rows 10, a row 
in[B] for identifying expressions that are available 
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upon entry to block B, a row out[B] for identifying 
expressions that are available upon exit from block B, a 
row gen[B] for identifying expressions that are 
generated by statements in block B, and a row kill [B] 
5 for identifying expressions whose constituent variables 
are modified by statements in block B. The columns 12 
in the matrix relate to particular expressions, numbered 
1, 2, 3 etc. Thus, the "1" located in the row for in[B 2 ] 
and the column for expression 6, indicates that 
10 expression 6 is available upon entry to block B 2 ; the "0" 
located in the row for infBJ and the column for 
expression 2, indicates that expression 2 is not 
available upon entry to block B x . 

A difficulty that arises with the 
15 representation shown in Fig. 1, is that in practice, 
most of the bits in the matrix are zero (i.e., the 
matrix is "sparse") . In a typical case where the bits 
in the matrix relate to the status of particular 
expressions, the matrix is typically sparse because, 
20 normally, specific expressions are only used or useful 
in a small portion of a procedure. A large, sparse bit 
matrix not only consumes large quantities of space, but 
also requires a large amount of time to repeatedly scan 
in the manner needed for complex dataflow analysis. 

25 
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SUMMARY OF THE INVENTION 



The invention addresses these and other 
problems associated with the prior art by utilizing a 
skip-list data structure for representing properties of 
points of interest in a procedure. When sets thus 
represented are sparse, this data structure 
substantially reduces the storage space required for 
storing such properties, and can be scanned much more 
rapidly than the corresponding sparse bit matrix. 

Specifically, in one aspect, the invention 
features a method of storing properties associated with 
a computer procedure, in a linked list of data storage 
nodes. Each of the nodes in the linked list stores a 
property of the computer procedure, and the nodes are 
ordered in accordance with a predetermined property 
order. When a new property is associated with the 
computer procedure, a data storage node for the property 
is generated and added to the linked list. Each data 
storage node includes a data storage space for storing 
an identifier of a property, and at least a first 
pointer storage space for storing a pointer identifying 
a location of an other data storage node. The data 
storage space of each data storage node stores an 
identifier of the new property for which the node was 
allocated. The first pointer storage space of each data 
storage node stores a pointer identifying a location of 
an other data storage node, specifically, the pointer 
identifies a subsequent node in the linked list. 

In disclosed specific embodiments, the linked 
list includes data storage nodes of either a first 
smaller size or a second larger size; the sizes of the 
data storage nodes are randomly selected. The data 
storage nodes of the second larger size include a second 



pointer storage space for storing a pointer identifying 
a location of an other data storage node. The pointer 
stored into the second pointer storage space identifies 
the location of the next subsequent data storage node of 
the second larger size. 

In the disclosed specific embodiment, when a 
property is disassociated with the computer procedure, 
the associated data storage node becomes dispensable and 
is deleted. 

In accordance with a second aspect of the 
invention, a special procedure is used to initialize 
skip lists, prior to performing data flow analysis, to 
ensure that the skip list structure is not used in an 
inefficient manner as a result of initialization, 
particularly where an iterative dataflow analysis 
technique is used in which iterations of the analysis do 
not increase the membership of the property sets. 
Specifically, each basic block of a computer procedure 
is associated with sets of entry properties and exit 
properties, as well as property modifications caused by 
the basic block. The basic blocks are then selected and 
initialized in a predetermined order. The entry 
properties of the currently selected basic block, are 
copied from exit properties of a previously selected and 
processed basic block. Next, the exit properties for 
the currently selected basic block, are computed from 
the entry properties and the property modifications 
associated with the currently selected basic block. 

In the disclosed specific embodiment, the 
entry and exit properties are sets of expressions 
available upon entry and exit from the basic block, and 
the property modifications are expressions generated and 
killed by the basic block. Also, to further improve the 
initialization of sets for a basic block, any 
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expressions not found in the sets of expressions 
available upon exit from all previously selected and 
processed control flow predecessors of the currently 
selected basic block, are removed from the set of 
expressions available upon entry to the. currently 
selected basic block. 

In a third aspect, the invention features the 
structure of the linked list per se. Specifically, this 
structure includes a plurality of data storage nodes, 
each data storage node including a data storage space 
for storing an identifier of a property and at least a 
first pointer storage space for storing a pointer 
identifying a location of an other data storage node. 
The data storage space of each node stores an identifier 
of a first property for which the node was allocated, 
and the first pointer storage space of each data storage 
node stores a pointer identifying a location of an other 
data storage node, the other data storage node storing a 
second property associated with the computer procedure 
that is subsequent to the first property in a 
predetermined property order. 

In a further aspect, the invention features a 
computer system for compiling a computer procedure into 
a machine-readable representation, comprising an 
optimizer that optimizes the computer procedure into an 
optimized representation by storing and manipulating 
properties associated with the computer procedure in 
accordance with the aspects described above. 

In still a further aspect, the invention 
features a program product configured to store 
properties associated with a computer procedure, in a 
linked list of data storage nodes in accordance with the 
aspects described above, and a signal bearing media 
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bearing the program, which may be a transmission type 
media or a recordable media. 

These and other advantages and features, which 
characterize the invention, are set forth in the claims 
5 annexed hereto and forming a further part hereof. 

However, for a better understanding of the invention, 
and the advantages and objectives attained by its use, 
reference should be made to the Drawing, and to the 
accompanying descriptive matter, in which there is 
10 described embodiments of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



FIGURE 1 is a data structure diagram 
illustrating the use in the prior art of a bit matrix to 
represent properties of points of interest in a computer 
procedure . 

FIGURE 2 is a block diagram of a computer 
system consistent with the invention. 

FIGURES 3A and 3B are data structure diagrams 
illustrating skip list data structures representing the 
information found in two rows of the bit matrix of Fig. 
2. 

FIGURE 4 is a flow chart of specific 
operations performed as part of a data flow analysis 
using skip list data structures such as shown in Figs. 
3A and 3B . 

FIGURE 5 is a flow chart of specific 
operations performed as part of initializing skip list 
data structures such as shown in Fig. 3A and 3B, for the 
data flow analysis process of Fig. 4. 

FIGURE 6A is a flow chart of specific 
operations performed as part of simultaneously scanning 
two skip lists and removing from one list any nodes not 
in the other, as part of the data flow analysis and 
initialization processes of Figs. 4 and 5. 

FIGURE 6B is a flow chart of specific 
operations performed as part of simultaneously scanning 
three skip lists and removing from one list any nodes 
not in one of the others, as part of the data flow 
analysis process of Fig. 4. 

FIGURE 6C is a flow chart of specific 
operations performed as part of simultaneously scanning 
three skip lists and adding to one list any nodes 
meeting certain conditions in the other two lists. 
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DETAILED DESCRIPTION 



Prior to discussing the operation of 
embodiments of the invention, a brief overview 
discussion of compilers and compiling techniques is 
provided herein. 

Overview of Compilers 

Compilers and the like are generally known in 
the art. One known type of compiler is a multi-pass 
optimizing compiler, which includes a front-end module 
for converting source code into an intermediate 
representation, and a back-end module which takes the 
intermediate representation and generates object code. 

The front-end module of a multi-pass 
optimizing compiler typically includes a lexicographic 
analyzer which identifies tokens or keywords in the 
source code, and a parser which analyzes the program 
statement by statement. The parser typically uses a 
context-free grammar to determine if program statements 
satisfy a set of grammar rules, and builds constructs. 
The parser then generates an intermediate representation 
using an intermediate code generator. 

The back-end module of a multi-pass optimizing 
compiler typically includes an optimizer or optimizing 
module which operates on the intermediate representation 
to generate a revised or optimized intermediate 
representation. Several different optimizations may be 
performed, including but not limited to local 
optimizations such as value numbering, elimination of 
redundant computations, register allocation and 
assignment, instruction scheduling to match specific 
machine characteristics, moving invariant code out of 
loops, strength reduction, induction variable 
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elimination, and copy propagation, among others. The 
back-end module also includes a final code generator to 
generate the object code from the revised intermediate 
representation. 

A compiler may reside within the memory of the 
computer system upon which the object code generated by 
the compiler is executed. Alternatively, a compiler may 
be a cross-compiler which resides on one computer system 
to generate object code for execution on another 
computer system. Either type of compiler may be used 
consistent with the invention. 

One suitable back-end module for use with the 
invention is an AS/400 optimizing translator supplied 
with an AS/400 minicomputer, which is a common back-end 
module of an optimizing compiler. This product may be 
used with a front-end module such as the ILE C Compiler 
available from IBM, among others. It will be 
appreciated that other compilers are suitable for 
different languages and/or different hardware platforms, 
and may also be used in the alternative-. 

Computer System 

Turning to the Drawing, wherein like numbers 
denote like parts throughout the several views, Fig. 2 
shows a block diagram of a computer system 20 consistent 
with the invention. Computer system 2 0 is an IBM AS/4 00 
minicomputer. However, those skilled in the art will 
appreciate that the mechanisms and apparatus consistent 
with the invention apply equally to any computer system, 
regardless of whether the computer system is a 
complicated multi-user computing apparatus or a single 
user device such as a personal computer or workstation. 
As shown in Fig. 2, computer system 20 includes a main 
or central processing unit (CPU) 22 connected through a 

Page 11 of 39 
IBM R0997-127 

WHSE IBM/09 
Patent Application 



12 

system bus 21 to a main memory 30, a memory controller 
24, an auxiliary storage interface 26, and a terminal 
interface 28. 

Auxiliary storage interface 26 allows computer 
5 system 20 to store and retrieve information from 

auxiliary storage such as magnetic disk, magnetic tape 
or optical storage devices. Memory controller 24, 
through use of a processor separate from CPU 22, moves 
information between main memory 30, auxiliary storage 

10 interface 26, and CPU 22. While for the purposes of 

explanation, memory controller 24 is shown as a separate 
entity, those skilled in the art understand that, in 
practice, portions of the function provided by memory 
controller 24 may actually reside in the circuitry 

15 associated with CPU 22 and main memory 30. Further, 
while memory controller 24 of the embodiment is 
described as having responsibility for moving requested 
information between main memory 30, auxiliary storage 
interface 26 and CPU 22, those skilled in the art will 

20 appreciate that the mechanisms of the present invention 
apply equally to any storage configuration, regardless 
of the number and type of the storage entities involved. 

Terminal interface 28 allows system 
administrators and computer programmers to communicate 

25 with computer system 20, normally through programmable 
workstations. Although the system depicted in Fig. 2 
contains only a single main CPU and a single system bus, 
it will be understood that the invention also applies to 
computer systems having multiple CPUs and buses. 

30 Main memory 30 is shown storing a compiler 40 

(comprising analyzer 42, parser 44, optimizer 46 and 
code generator 48) and operating system 32. Memory 30 
also includes a workspace 50, which is shown storing a 
computer program in various stages of compilation, 
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13 

including a source code representation 52, an 
intermediate representation 54, an optimized 
representation 56 and object code 58. However, it 
should be understood that main memory 30 will not 
5 necessarily always contain all parts of all mechanisms 
shown. For example, portions of compiler 4 0 and 
operating system 32 will typically be loaded into caches 
in CPU 22 to execute, while other files may well be 
stored on magnetic or optical disk storage devices. 

10 Moreover, the various representations 52-58 of a 

computer program may not be resident in the main memory 
at the same time. Various representations may also be 
created by modifying a prior representation in situ. In 
addition, as discussed above, the front-end and back-end 

15 modules in some systems may be separate programs. 

It will be appreciated that computer system 20 
is merely an example of one system upon which the 
routines may execute. Further, as innumerable 
alternative system designs may be used, principles of 

20 the present invention are not limited to any particular 
configuration shown herein. 

In general, the routines executed to implement 
the illustrated embodiments of the invention, whether 
implemented as part of an operating system or a specific 

25 application, program, object, module or sequence of 
instructions will be referred to herein as "computer 
programs". The computer programs typically comprise 
instructions which, when read and executed by one or 
more processors in the devices or systems in a computer 

30 system consistent with the invention, cause those 

devices or systems to perform the steps necessary to 
execute steps or generate elements embodying the various 
aspects of the present invention. Moreover, while the 
invention has and hereinafter will be described in the 
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14 

context of fully functioning computer systems, those 
skilled in the art will appreciate that the various 
embodiments of the invention are capable of being 
distributed as a program product in a variety of forms, 
5 and that the invention applies equally regardless of the 
particular type of signal bearing media used to actually 
carry out the distribution. Examples of signal bearing 
media include but are not limited to recordable type 
media such as volatile and non-volatile memory devices, 

10 floppy disks, hard disk drives, CD-ROM's, DVD's, 

magnetic tape, etc., and transmission type media such 
as digital and analog communications links. 
Use of Computer System 

Referring now to Figs. 3A and 3B, an 

15 explanation can be provided of the skip list data 

structure of the kind used in accordance with principles 
of the present invention. The skip list illustrated in 
Fig. 3A stores the set of properties in[B.J, 
specifically, the set of expressions available upon 

20 entry to basic block B x of the computer procedure 
currently undergoing compilation. The skip list 
illustrated in Fig. 3B stores the set of properties 
in[B 2 ], specifically, the expressions available upon 
entry to basic block B 2 of the computer procedure 

25 currently undergoing compilation. 

As seen in Figs. 3A and 3B, each skip list 
begins with an initial data storage node 60a and 60b, 
and includes a number of data storage nodes 62a/62b, 
64a/64b, 66a/66b, 68a/68b, 70a/70b, 72a/72b etc. Each 

30 node includes a storage space 84 for storing a key for a 
property of the computer procedure. For example, the 
skip list shown in Fig. 3A, which identifies those 
expressions which are available upon entry to basic 
block B lf includes nodes storing the keys "3", "6", "7", 
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15 

"9", "12", "17", "19", "21", "25" and "26", thus 
indicating that the expressions associated with these 
keys are available upon entry to basic block B x . 

The nodes in a skip list are one of a number 
5 of sizes. Nodes of the smallest size (which will be 
referred to as "size 1"), such as nodes 62a and 62b, 
include only (a.) a storage space 84 for storing a key 
of a property of the computer procedure, and (b.) a 
storage space 8 6 for storing a "level 1" pointer 

10 (computer memory address) indicating the location of the 
next node in the skip list. Nodes of the next larger 
size (which will be referred to as "size 2") , such as 
nodes 68a and 64b, include storage spaces 84 and 86, and 
in addition, include a storage space 88 for storing a 

15 "level 2" pointer indicating the location of a 

subsequent node in the skip list. Specifically, the 
level 2 pointer in storage space 88 indicates the 
location of the next node in the skip list of at least 
size 2. 

20 The skip lists illustrated in Figs. 3A and 3B 

also include "size 3" nodes (e.g., nodes 78a and 70b) 
which include, in addition to the storage spaces 84, 86 
and 88 found in a size 2 node, an additional pointer 
storage space 90 for storing a "level 3" pointer to the 

25 next node in the skip list of at least size 3. 

Furthermore, the skip list of Fig. 3A includes a "size 
4" node 64a, which includes, in addition to the storage 
spaces 84, 86, 88 and 90 found in a size 3 node, a 
"level 4" additional pointer storage space 92 for 

30 storing a pointer to the next node in the skip list of 
at least size 4. 

The maximum number of levels for skip list 
nodes is arbitrary, and may, in one embodiment, be 
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selected based on the criteria identified by the above- 
referenced paper by Pugh. 

Skip lists terminate at a "NIL" node 82. The 
level 1 pointer of the last node in the skip list points 
5 to the NIL node 82. Furthermore, the level 2 pointer of 
the last node in the skip list of at least size 2, 
points to the NIL node. The level 3 pointer of the last 
node in the skip list of at least size 3, points to the 
NIL node. The level 4 pointer of the last node in the 

10 skip list of at least size 4, points to the NIL node. 

The keys for properties stored in a skip list, 
are generated so that the keys can be sorted in a 
predetermined, defined order, in order to simplify the 
process of performing a membership test of a set 

15 represented by a skip list. In the example below where 
the skip lists store sets of expressions related to the 
computer procedure, a set of keys with a predetermined 
ordering can be created, for example, by numbering each 
expression as it is encountered during a scan of the 

20 entire program in depth-first order. Then, these 

numbers can be used as the keys in the skip list data 
storage fields 84. To facilitate processing, the key 
for the NIL node is the highest number that can be 
represented in the data format used to store the keys, 

25 so that the key of the NIL node is always numerically 
greater than any other key in a skip list. 

Specific details on the use of skip lists, 
including the process for scanning a skip list to 
perform a membership test, methods for randomly 

30 selecting the size of skip list nodes, and further 

descriptions of the advantages of skip lists, can be 
found in William Pugh, Skip Lists: A Probabilistic 
Alternative to Balanced Trees , Communications of the 
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ACM, June 1990, Vol. 33, Number 6, Pages 668 to 676, 
which is hereby incorporated herein in its entirety. 

Referring now to Fig. 4, the process for 
performing data flow analysis using the data structures 
discussed above can be more completely explained. 
Specifically, for initialization, a loop 100, 102, 104 
is performed. In this loop, for each block B in the CFG 
(step 100), the properties gen[B] and kill [B] are 
computed for the block (step 102), and the next block is 
selected (step 104), until all blocks have been 
processed. This step involves adding a node to the skip 
lists storing gen[B] and kill[B] for each expression to 
be included in those sets. Details on the process for 
adding a node to a skip list can be found from the 
above-referenced paper by Pugh. Details on methods for 
identifying expressions to be included in gen[B] and 
kill[B] as part of step 102 are described in the above- 
referenced section of the Aho et al. book. 

After computing the properties gen[B] and 
kill [B] , processing continues to step 106, in which an 
initial approximation of the sets in[B] and out[B] are 
computed for each basic block B. Details on this 
initialization process are set forth below with 
reference to Fig. 5. 

After the appropriate initialization has been 
performed, a loop including steps 108, 110, 112, 114, 
116, 118, 120 and 122 is performed to make a dataflow 
analysis on the program and specifically the basic 
blocks thereof, to obtain final representations of in[B] 
and outtB], which can then be used in subsequent program 
optimizations such as common { sub } expression 
elimination. Each of steps 110, 112, 114, 116 and 118 
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is performed for each block B in the CFG (step 108) 
until all blocks have been processed (step 120) . 

For each block, taken in depth-first order, a 
second loop including 110, 112 and 114 is performed, for 
5 the purpose of eliminating expressions that are not 
available upon exit of the predecessors of that block. 
Specifically, for each predecessor P of the currently 
selected block B (step 110) , the skip lists storing the 
sets in[B] and out[P] are "walked", or scanned, 

10 simultaneously, removing from in[B] any expressions not 
found in out[P] (step 112) . Further details on the 
operations performed in connection with step 112 are 
provided below with reference to Fig. 6A. Step 112 is 
repeated for each predecessor P until all predecessors 

15 have been processed (step 114) . 

After completing this loop for each 
predecessor P of the currently selected block B, in step 
116, it is determined whether in[B] was changed as a 
result of steps 110, 112 and 116. (Note that the only 

20 change that might be made to in[B] is the removal of one 
or more members from the set of expressions represented 
by in[B].) if there have been changes to in[B], then 
processing proceeds to step 118, in which out[B] for the 
same block is updated to reflect the changes to in[B]. 

25 Specifically, the skip lists which store a 

representation of in[B], out[B], and gen[B], are 
simultaneously "walked", or scanned, to identify those 
nodes of out[B] which should be removed as a result of 
the changes to in[B] . Further specific details on this 

30 process will be provided in connection with Fig. 6B . 

After out[B] has been updated' through step 
118, or immediately after step 116 if in[B] has not 
changed, the next block B in the CFG is selected, and 
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processing returns to step 110. This process repeats 
until every block B in the CFG has been processed, at 
which time, processing proceeds to step 122. In step 
122, it is determined whether any of the inf.] or out[.] 
sets, for any of the blocks in the CFG, were changed as 
a result of the previous pass through the blocks in the 
steps of loop 108, 110, 112, 114, 116, 118, 120. If so, 
then another pass through the blocks must be performed, 
and processing returns to step 108. However, if none of 
the in[.] and out[.] sets have changed in the previous 
pass, then correct final versions of the in[.] and 
out[.] sets have been achieved, and the data flow 
analysis is completed. Subsequently, the sets for in[.] 
and out[.] can be used in various optimizations, such as 
common sub-expression elimination. (Other methods may be 
used to select blocks to process and the order of 
processing, for example, a "worklist" approach could be 
used, in which a block is only processed if any of its 
predecessors changed since the last pass.) 

Referring now to Fig. 5, the process for 
calculating an initial approximation can be described in 
greater detail. As is noted above, the process for 
updating the in [ . ] and out[.] sets illustrated in Fig. 
4, updates these sets by removing those members which 
are determined to be unavailable upon entry and upon 
exit from specific blocks. Typically, a dataflow 
analysis process of this kind is initialized by 
including all possible expressions into in[B] and out[B] 
for every block B in the CFG, and then using a dataflow 
analysis to remove expressions in an iterative manner 
analogous to that shown in Fig. 4, until final resulting 
sets of expressions are obtained. 
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While this method of initializing the sets 
in [ . ] and out[.] is effective when these sets are 
represented as bit vectors as is shown in Fig. 1, where 
densely populated sets can be efficiently represented, 
5 this method is not effective when these sets are 

represented as skip lists such as is shown in Figs. 3A 
and 3B, because skip lists are intended for use in 
representing sparsely populated sets and are not 
efficient in representing densely populated sets. 

10 Therefore, in accordance with principles of 

the present invention, a novel methodology is used to 
initialize the sets in [ . ] and out [ . ] . This methodology 
is based on the recognition that (1) the dataflow 
analysis conducted in accordance with Fig. 4 only 

15 reduces the number of members in these sets, and (2) it 
is only necessary that in [ . ] and out[.] initially 
include all members that might be included after 
dataflow analysis. If a simplified analysis of the 
blocks B in the CFG can identify all of the members that 

20 might possibly be included in the in [ . ] and out[.] sets 
after a complete data flow analysis, then only those 
members that might possibly be included need be included 
when the in [ . ] and out[.] sets are initialized. 

To perform an appropriate initialization, 

25 therefore, in accordance with principles of the present 
invention, a single forward pass is made through the 
CFG, initializing the inf.] and out [ . ] sets by assuming 
that, in any given block B, all expressions available on 
exit from all control flow predecessors of block B that 

30 appear earlier in the depth-first order than block B, 
will be available in block B. 

Specifically, in a first step 130, sets are 
generated for an initial block Init, which by definition 
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precedes the first block in the CFG. The sets inflnitj 
and out[Init] for the block Init are initialized to be 
empty. Then, beginning at step 132, a loop, including 
steps 134 through 142 is performed for each block in the 
5 CFG, taken in depth-first order. 

In the first step 134, the set in[B] for the 
current block B is copied from out[P] for a predecessor 
block P of the block B (where the predecessor P 
satisfies the requirement that df o (P) <df o (B) ) . This 

10 makes a first approximation for in[B], based on 

knowledge that no expression is available upon entry to 
block B unless it is available on exit from every 
predecessor P of B. Thus, the first approximation of 
in[B] formed by this step will include every expression 

15 that might be in the final set for in[B] after a 
complete dataflow analysis. 

After computing this initial approximation, 
beginning at step 136, a loop, including step 138, is 
performed for each remaining control flow predecessor P 

20 of block B which satisfies the requirement that 

dfo (P) <dfo (B) . Specifically, any expressions in the 
approximation for in[B] which are not available upon 
exit from any qualifying predecessor, are removed from 
in[B]. Specifically, in step 138, the skip lists 

25 representing in[B] and out[B] are simultaneously 

"walked", or scanned, to remove from in[B] any nodes not 
found in out[P] . Specific details of the operations 
involved in step 138 are detailed below' with reference 
to Fig. 6A. 

30 At step 140, step 138 is repeated until every 

predecessor P has been considered. The resulting 
iterative loop improves the approximation for in[B] 
formed in step 134 by limiting in[B] to those 
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expressions which are available upon exit from all 
predecessor blocks P appearing prior to B in the depth- 
first ordering. 

After all predecessors P for a block have been 
5 processed through step 138, processing continues to step 
142, at which an initial approximation of out[B] is 
generated. Specifically, out[B] is approximated from 
in[B] by adding to in[B] all expressions generated by 
block B (as represented by the previously-computed set 

10 gen[B]), and removing from in[B] any expressions killed 
by block B (as represented by the previously-computed 
set kill [B] ) . To make this approximation, the skip 
lists for in[B], kill[B] and gen[B] are "walked", or 
scanned, simultaneously, adding to out[B] any nodes 

15 found in gen[B] and any nodes found in in[B] but not in 
kill [B] . Details on this operation are described below 
in connection with Fig. 6C . 

After step 142, at step 144, control is 
returned to step 134 to process the next block B in the 

20 CFG taken in depth-first order, until all blocks B have 
been so processed. Once all blocks B have been 
processed, the sets in[B] and out[B] are appropriately 
initialized so that these sets have an initial 
approximation of their final contents which contains all 

25 of the members that might be included after a complete 

dataflow analysis according to Fig. 4, without including 
all expressions into these sets and thus incurring 
inefficient storage. 

Referring now to Fig. 6A, details of the 

30 operations performed in steps 112 (Fig. 4) and 138 (Fig. 
5) to walk the skip lists of in[B] and out[P] 
simultaneously, removing from in[B] any nodes not found 
in out[P] . As a first step in this process, in step 
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148, the first nodes in the skip lists for in[B] and 
out[P] are selected. Next, in step 150 it is determined 
whether the end of the skip list representing in[B] has 
been reached, which is determined by determining whether 
the key of the current node of the in[B] skip list is 
the highest integer, which is the key of the "NIL" node 
82. If so, no further processing is needed and the 
operation is done. 

However, if in step 150 the end of the in[B] 
skip list has not been reached, then, in step 152, it is 
determined whether the key in the current node in the 
out[P] skip list, follows the key in the current node in 
the in[B] skip list in the predetermined key order. If 
so, this indicates that the expression represented by 
the current node in the in[B] skip list, is not present 
in the out[P] skip list. (Note that if the end of the 
out[P] skip list has been reached, then the key of the 
current node in the out[P] skip list is greater than any 
key in the in[B] skip list, and thus the current and all 
subsequent nodes in the in[B] skip list should be 
removed and are removed by repetitions of steps 154, 
156, 150 and 152, until the end of the in[B] skip list 
is reached.) When the current node in the in[B] skip 
list is not present in the out[P] skip list, processing 
proceeds to step 154, where the current node in the 
in[B] skip list is deleted. (Details on the process for 
deleting a node from a skip list can be found in the 
above-referenced paper by Pugh.) As part of deleting 
the current node in in[B], a flag may be set, so that it 
can later be determined {in step 116, Fig. 4) that in[B] 
was changed. After deleting the current node in the 
in[B] skip list, in step 156, the next node in the in[B] 
skip list is selected (by following the level 1 pointer 
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of the current node to the first subsequent node) , and 
processing returns to step 150. 

If, in step 152, the key in the current node 
in the out[P] skip list, does not follow the key in the 
current node in the in[B] skip list in the predetermined 
key order, then processing continues to step 158. In 
step 158, it is determined whether the key in the 
current node in the in[B] skip list is equal to the key 
in the current node in the out[P] skip list. If not, 
then the key in the current node in the out[P] skip list 
must precede the key in the current node in the in[B] 
skip list, and accordingly, processing proceeds to step 
162, where the next expression in the out[P] skip list 
is selected (by following the level 1 pointer of the 
current node) , after which processing returns to step 
150. 

If, however, in step 158, it is determined 
that the keys in the current nodes in the in[B] and 
out[P] skip lists are equal, then the expression 
represented by the current node in the in[B] skip list 
is in out[P], in which case, the next expression in the 
in[B] and out[P] skip lists should be evaluated. Thus, 
in this case, processing proceeds to step 160. In step 
160, the next node in the in[B] list is selected (by 
following the level 1 pointer of the current node) , and 
then processing proceeds to step 162, where the next 
node in the out[P] skip list is selected. 

Referring now to Fig. 6B, the processing of 
step 118 (Fig. 4) can be described in more detail. In 
this step, the skip lists for in[B], gen[B] and out[B] 
are walked together, removing from out[B] any nodes not 
in either of in[B] or gen[B]. 
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To begin this process, in step 170 the first 
nodes in the in[B], gen[B] and out[B] nodes are 
selected. Next, in step 172, the current node in the 
out[B] list is evaluated, to determine whether the end 
5 of the out[B] skip list has been reached (by comparing 
the key for the current out[B] node to the largest 
possible integer, which is used in the NIL node) . If 
the end of the out[B] skip list has been reached, then 
processing is done. 

10 If the end of the out[B] skip list has not 

been reached, however, then processing proceeds to step 
174, in which the key of the current node in the gen[B] 
skip list is compared to the key of the current node of 
the out[B] skip list. If the key of the current node in 

15 the out[B] skip list follows the key of. the current node 
in the gen[B] skip list in the predetermined key 
ordering, then processing proceeds to step 176, and the 
next node in the gen[B] skip list is selected (by 
following the level 1 pointer of the current node in the 

20 gen[B] skip list), and then returns to step 174. This 
small loop including steps 174 and 176 continues to 
select subsequent elements in the gen[B] skip list until 
the key of the current element in gen[B] is either equal 
to or follows the key of the current node in the out[B] 

25 skip list. Once the key of the current node in the 

gen[B] skip list is either equal to or follows the key 
in the current node in the out[B] skip list, processing 
proceeds to step 178. 

In step 178, the key of the current node in 

30 the in[B] skip list is compared to the key of the 

current node of the out[B} skip list. If the key of the 
current node in the out[B] skip list follows the key of 
the current node in the in[B] skip list in the 
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predetermined key ordering, then processing proceeds to 
step 180, and the next node in the in[B] skip list is 
selected (by following the level 1 pointer of the 
current node in the in[B] skip list), and then returns 
5 to step 178. This small loop including, steps 178 and 
180 continues to select subsequent elements in the in[B] 
skip list until the key of the current element in in[B] 
is either equal to or follows the key of the current 
node in the out[B] skip list. Once the key of the 

10 current node in the in[B] skip list is either equal to 
or follows the key in the current node in the out[B] 
skip list, processing proceeds directly to step 182. 

In step 182, the key of the current node in 
the out[B] skip list is compared to the keys of the 

15 current nodes in the gen[B] and in[B] skip lists. If 
the key of the current node in the out[B] skip list is 
not equal to either of the keys in the current nodes of 
the gen[B] or in[B] skip lists, then it can be 
determined that the expression represented by the key of 

20 the current node of the out[B] skip list is not in 
either gen[B] or in[B]; therefore, in this case, 
processing proceeds to step 184, and the current node in 
the out[B] skip list is deleted (details on the process 
for deleting a node from a skip list can be found in the 

25 above-referenced paper by Pugh) . After step 184, the 

next element in the out[B] skip list is selected in step 
186, and processing returns to step 172. However, if in 
step 182 the key of the current node in the out[B] skip 
list is equal to either or both of the keys in the 

30 current nodes of the gen[B] or in[B] skip lists, then 
the expression represented by the key of the current 
node of the out[B] skip list is in one of gen[B] or 
in[B]; therefore, in this case, processing proceeds to 
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directly to step 186, and the next node in out[B] is 
selected, after which, processing returns to step 172. 

Referring now to Fig. 6C, the processing of 
step 142 (Fig. 5) can be described in more detail. In 
5 this step, the skip lists for in[B], gen[B] and kill [B] 
are walked together, adding to out[B] any nodes found in 
in[B] but not in kill [B] . 

To begin this process, in step 190 the first 
nodes of in[B], gen[B] and kill [B] are selected. Then, 

10 in step 192, the keys in the current nodes in in[B] and 
gen[B] are evaluated to determine whether these nodes 
are the NIL node (having the highest integer number as a 
key) . If both of the current nodes are the NIL node, 
then processing is done. However, if this is not the 

15 case, then processing proceeds to step 194, in which the 
key of the current node in the gen[B] skip list is 
compared to the key in the current node in the in[B] 
skip list. If the key in the current node in the in[B] 
skip list follows the key in the current node in the 

20 gen[B] skip list, this indicates that there is at least 
one expression generated in the current block B, 
represented by a node in gen[B], that is not in in[B]; 
therefore, in this case, processing proceeds to step 
196, where the key in the current gen[B] node is added 

25 to the end of out[B], thus including the generated 

expression in the set of expressions available upon exit 
from block B; thereafter, in step 198, the next node in 
gentB] is selected, by following the level 1 pointer of 
the current node in gen[B], and then processing returns 

30 to step 194. This loop including steps 194, 196 and 198 
is repeated until all of the elements which appear in 
gen[B] and are prior to the current element in in[B], 
have been added to the end of out[B], (Note that 
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because the present routine is performed during 
initialization of out[B], the set of expressions 
represented by out[B] is initially empty, and so it is 
not necessary to insert nodes into any middle points in 
5 out[B] . This means that the process of adding nodes can 
be fairly rapid. The last node of each size is stored 
in memory as nodes are added. Then, when adding a new- 
node to the end of the list of size n, the last node of 
each height less than or equal to n has its pointer (s) 

10 updated to point to the new node. The new node then 
becomes the last node of size n.) 

Eventually, by proceeding to subsequent nodes 
in the gen[B] skip list, eventually a node is reached in 
gen[B] which has a key that does not precede the key of 

15 the current in[B] node in the predetermined key order. 
In this situation, processing proceeds from step 194 to 
step 200, where it is determined whether the current 
node in the in[B] skip list is the NIL node. If so, 
then the end of the in[B] skip list has been reached, 

20 and processing returns directly to step 192. 

However, if the end of the in[B] skip list has 
not been reached, then processing proceeds to step 202, 
in which the key of the current node of the in[B] skip 
list is compared to the key of the current node of the 

25 kill [B] skip list. The purpose of this and the 

following steps 204 and 206 is to determine whether the 
key of the current node of the in[B] skip list should be 
added to the out[B] skip list, and to take appropriate 
actions. If the key of the current node of the kill [B] 

30 skip list precedes the key of the current node of the 
in[B] skip list in the predetermined key order, then 
processing proceeds to step 204, in which the next node 
in the kill [B] skip list is selected (by following the 
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level 1 pointer of the current node of the kill[B] skip 
list), and processing returns to step 2D2. 

This loop between steps 202 and 204 is 
continued until the key of the current node in the 
5 kill [B] skip list does not precede the key of the 

current node of the in[B] skip list. Once this occurs, 
processing proceeds from step 202 to step 206, in which 
it is determined whether the key of the current node in 
the kill[B] skip list is equal to the key of the current 

10 node in the in[B] skip list. If so, then the key found 
in in[B] is also found in kill [B] ; in such a situation, 
the key should only be added to out[B] if the key also 
appears in gen[B]. Stated another way, an expression 
which is available upon entry to a block, but is killed 

15 by that block, will not be available upon exit from the 
block unless the expression is also generated by the 
block. Accordingly, in this situation, processing 
proceeds to step 212, in which the key of the current 
node in the in[B] skip list is compared to the key of 

20 the current node in the gen[B] skip list. If the keys 
are equal, then the expression represented by the 
current key in the current node of the in[B] skip list 
is generated by the current block B, and accordingly 
processing continues to step 214, in which a node with 

25 this key is added to the end of the out[B] skip list. 
Thereafter, the next node of the gen[B] skip list is 
selected in step 216, and the next node of the in[B] 
skip list is selected in step 218, and processing 
returns to step 192. 

30 If, in step 212, it is determined that the key 

of the current node in the in[B] skip list is not equal 
to the key of the current node in the gen[B] skip list, 
then the expression represented by the key of the in[B] 
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node is not generated by the current block B. 
Accordingly, in this case, processing proceeds directly 
to step 218 to select the next node in in[B], and then 
returns to step 192. 
5 Furthermore, if in step 206, it is determined 

that the key of the current node in the' in[B] skip list 
is not equal to the key of the current node in the 
kill[B] skip list, then it is determined that the 
expression represented by the key in the current node of 

10 the in[B] skip list is available upon entry to block B 
and not killed by block B. In this case, processing 
proceeds to step 208, in which a node is added to the 
end of out[B] containing the key of the current node of 
in[B], thus effectively adding this expression to 

15 out[B]. Next, in step 210, the key of the current node 
of in[B] is compared to the key of the current node of 
gen[B]. If the keys are equal, then processing proceeds 
to step 216 to select the next node in gen[B] and then 
(in step 218) the next node in in[B] before returning to 

20 step 172. However, if the keys are unequal, then the 

key of the current node of gen[B] must follow the key of 
the current node in in[B], and accordingly processing 
proceeds directly to step 218 so that the next node is 
selected in only in[B] before proceeding to step 192. 

25 Following the foregoing procedures, a skip 

list representation can be used for the sets in a 
dataflow analysis performed by a compiler, and in 
subsequent optimizations, substantially improving the 
efficiency of data storage as compared to the use of bit 

30 vectors. 

It will therefore be appreciated that the 
invention provides significant advantages in terms of 
optimization of computer procedures during compilation, 
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resulting in more efficient compilation. It will also 
be appreciated, that numerous modifications may be made 
to the disclosed embodiments consistent with the 
invention. For example, in some programs each block in 
5 the CFG may kill a large volume of the universe of 
expressions. If this is the case, then it may be 
advantageous to represent the sets kill[B] as bit 
vectors in the manner shown in Fig. 1, rather than as 
skip lists. If this approach is taken, then steps 202, 

10 204 and 206 of Fig. 6C would be replaced with a single 
step performing a comparison of the key of the current 
node of in[B] to the bit vector representing kill [B] to 
determine whether the key of the current node of in[B] 
is a member of kill [B] . If so, processing would proceed 

15 to step 212, and if not, processing would proceed to 
step 208. Other modifications may be made to the 
disclosed embodiments without departing from the spirit 
and scope of the invention. Therefore, the invention 
lies in the claims hereinafter appended. 
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CLAIMS 

What is claimed is: 

1. A method of storing dataflow analysis properties 
associated with a computer procedure, in a linked list 
of data storage nodes, the method comprising generating 
a data storage node for a new property to be associated 
with the computer procedure by 

allocating a data storage node, the data storage 
node including a data storage space for storing an 
identifier of a property and at least a first pointer 
storage space for storing a pointer identifying a 
location of an other data storage node, 

storing into the data storage space of the 
allocated data storage node, an identifier of the new 
property for which the node was allocated, and 

storing into the first pointer storage space of the 
data storage node, a pointer identifying a location of 
an other data storage node, the other data storage node 
storing an existing property associated with the 
computer procedure that is subsequent to the new 
property in a predetermined property order. 

2. The method of claim 1 adapted to preparing a 
linked list including data storage nodes of either a 
first smaller size or a second larger size, wherein data 
storage nodes of the second larger size include a second 
pointer storage space for storing a pointer identifying 
a location of an other data storage node, wherein the 
allocating step further comprises 

selecting a size for the data storage node. 

3. The method of claim 2 wherein the size for the 
data storage node is randomly selected. 
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4. The method of claim 1 wherein the pointer stored 
into the first pointer storage space identifies a 
location of a data storage node storing- an existing 
property associated with the computer procedure that is 
next subsequent to the new property in the predetermined 
property order. 

5. The method of claim 2 wherein the pointer stored 
into the first pointer storage space identifies a 
location of a subsequent data storage node of the first 
smaller size that is storing an existing property 
associated with the computer procedure that is next 
subsequent to the new property in the predetermined 
property order. 

6. The method of claim 5 wherein the pointer stored 
into the second pointer storage space identifies a 
location of a subsequent data storage node of the second 
larger size that is storing an existing property 
associated with the computer procedure that, as compared 
to all existing properties associated with the computer 
procedure that are stored in data storage nodes of the 
second larger size, is next subsequent to the new 
property in the predetermined property order. 

7. The method of claim 1 further comprising 
deleting a data storage node for a property to be 
disassociated with the computer procedure. 

8. A linked list of data storage nodes storing 
dataflow analysis properties associated, with a computer 
procedure, comprising 

a plurality of data storage nodes, each data 
storage node including a data storage space for storing 
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an identifier of a property and at least a first pointer 
storage space for storing a pointer identifying a 
location of an other data storage node, 

the data storage space of each node storing an 
identifier of a first property for which the node was 
allocated, and 

the first pointer storage space of' each data 
storage node storing a pointer identifying a location of 
an other data storage node, the other data storage node 
storing a second property associated with the computer 
procedure that is subsequent to the first property in a 
predetermined property order. 

9. The linked list of claim 8 comprising data 
storage nodes of either a first smaller size or a second 
larger size, wherein data storage nodes of the second 
larger size include a second pointer storage space for 
storing a pointer identifying a location of an other 
data storage node. 

10. The linked list of claim 9 wherein the sizes of 
individual nodes in said linked list are randomly 
determined. 

11. The linked list of claim 9 wherein the pointer 
stored into the second pointer storage space of a node 
of the second larger size, identifies a location of a 
subsequent data storage node of the second larger size. 

12. A method of calculating approximations of sets 
of entry and exit properties for each basic block of a 
computer procedure prior to performance of an iterative 
dataflow analysis, each basic block being associated 
with a set of entry and exit properties, as well as 
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property modifications caused by the basic block, 
wherein each iteration of the iterative dataflow 
analysis process does not increase the membership of 
said sets, the method comprising selecting and 
processing each basic block in a predetermined order by 

copying into the set of entry properties of a 
currently selected basic block, the exit set of 
properties from a previously selected and processed 
basic block, 

modifying the set of entry properties of the 
currently selected basic block in accordance with the 
property modifications caused by the currently selected 
basic block, to generate the exit properties for the 
currently selected basic block. 

13. The method of claim 12, wherein said 
predetermined order is a depth-first order. 

14. The method of claim 12, wherein said entry and 
exit properties comprise expressions available upon 
entry and exit from each basic block. 

15. The method of claim 14, wherein said 
modifications caused by a basic block comprise 
expressions generated by the basic block. 

16. The method of claim 15, wherein said 
modifications caused by a basic block further comprise 
expressions killed by the basic block. 

17. The method of claim 14, wherein said 
modifications caused by a basic block comprise 
expressions killed by the basic block. 
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18. The method of claim 12, wherein the entry 
properties of the currently selected basic block are 
copied from the exit properties of a control flow 
predecessor of the currently selected basic block. 

19. The method of claim 12, wherein processing the 
current basic block further comprises removing from the 
entry properties of the currently selected basic block, 
any properties not found in the exit properties of all 
previously selected and processed basic blocks which are 
control flow predecessors of the currently selected 
basic block. 



20. A computer system for compiling a computer 
procedure into a machine-readable representation, the 
computer system comprising: 

(a) an optimizer that optimizes the computer 
procedure into an optimized representation, the 
optimizer storing dataflow analysis properties 
associated with the computer procedure, in a linked 
list of data storage nodes, a data storage node for 
a new property to be associated with the computer 
procedure being generated by allocating a data 
storage node, the data storage node including a 
data storage space for storing an identifier of a 
property and at least a first pointer storage space 
for storing a pointer identifying a location of an 
other data storage node, storing into the data 
storage space of the allocated data storage node, 
an identifier of the new property for which the 
node was allocated, and storing into the first 
pointer storage space of the data storage node, a 
pointer identifying a location of an other data 
storage node, the other data storage node storing 
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an existing property associated with the computer 
procedure that is subsequent to the new property in 
a predetermined property order; and 

(b) a machine-readable code generator that 
generates a machine-readable representation of the 
computer procedure from, the optimized 
representation . 

21. A program product, comprising: 

(a) a program configured to store dataflow analysis 
properties associated with a computer procedure, in a 
linked list of data storage nodes, and generate a data 
storage node for a new property to be associated with 
the computer procedure by 

allocating a data storage node, the data 
storage node including a data storage space for 
storing an identifier of a property and at least a 
first pointer storage space for storing a pointer 
identifying a location of an other data storage 
node, 

storing into the data storage space of the 
allocated data storage node, an identifier of the 
new property for which the node was allocated, and 

storing into the first pointer storage space 
of the data storage node, a pointer identifying a 
location of an other data storage node, the other 
data storage node storing an existing property 
associated with the computer procedure that is 
subsequent to the new property in a predetermined 
property order; and 

(b) a signal bearing media bearing the program. 

22. The program product of claim 21, wherein the 
signal bearing media is a transmission type media. 
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23. The program product of claim 21, wherein the 
signal bearing media is a recordable media. 
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ABSTRACT OF THE DISCLOSURE 

A compiler and method of compiling provide enhanced 
performance by utilizing a skip list data structure to 
store various properties of a program at points of 
interest in the procedure, for example, the properties 
of the statements in each block in the control flow 
graph. A special procedure is used to initialize the 
skip list, prior to performing data flow analysis, to 
ensure that the skip list structure is not used in an 
inefficient manner as a result of initialization. 
Furthermore, special procedures are used to 
simultaneously scan and compare two skip lists as part 
of solving dataflow equations. 
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As a below named inventor, I hereby declare that: 



My residence, post office address and citizenship are as stated beiow 
next to my name. 

I believe I am the original, first and sole inventor (if only one name is 
listed below) or an original, first and joint inventor (if plural names are listed below) 
of the subject matter which is claimed and for which a patent is sought on the 
invention entitled: 

SKIP LIST DATA STORAGE DURING COMPILATION 

the specification of which (check one below): 



( X ) is attached hereto. 



( ) was filed on as Application Serial No. or 

Express Mail No. , Serial No. not yet known, and was 

amended on (if applicable). 

( ) was filed on as PCT International Application 

No. , and as amended under PCT Article 19 on 

(if any). 

I hereby state that I have reviewed and understand the contents of the 
above identified specification, including the claims, as amended by any amendment 
referred to above. 



I acknowledge the duty to disclose to the United States Patent and 
Trademark Office all information known to me to be material to patentability as 
defined in Title 37, Code of Federal Regulations §1.56. 

! hereby claim foreign priority benefits under Title 35, United States 
Code, §119 of any foreign application(s) for patent or inventor's certificate listed 
below and have also identified below any foreign application for patent or inventor's 
certificate having a filing date before that of the application on which priority is 
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I hereby claim the benefit under Title 35, United States Code, §120 
and/or §1 19(e) of any United States application(s) listed below and, insofar as the 
subject matter of each of the claims of this application is not disclosed in the prior 
United States application in the manner provided by the first paragraph of Title 35, 
United States Code, §112, i acknowledge the duty to disclose to the United States 
Patent and Trademark Office all information known to me to be material to 
patentability as defined in Title 37, Code of Federal Regulations §1 .56, which became 
available between the filing date of the prior application and the national or PCT 
international filing date of this application. 
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I hereby appoint John D. Poffenberger (R. No. 20,245), Bruce Tittel 
(R. No. 22,324), Donald F. Frei (R. No. 21,190), David J. Josephic (R. No. 22,849), 
A. Ralph Navaro, Jr. (R. No. 23,050), David S. Stallard (R. No. 25,930), J. Robert 
Chambers (R. No. 25,448), Gregory J. Lunn (R. No. 29,945), Kurt L. Grossman 
(R. No. 29,799), Clement H. Luken, Jr. (R. No. 32,742), Thomas J. Burger 
(R. No. 32,662), Gregory F. Ahrens (R. No. 32,957), Joseph R. Jordan 
*(R. No. 25,686), Wayne L Jacobs (R. No. 35,553), Kurt A. Summe (R. No. 36,023), 
Keith R. Haupt (R. No. 37,638), Kevin G. Rooney {Reg. No. 36,330), C. Richard Eby 
(R. No. 25,854), Theodore R. Remaklus (R. No. 38,754), Thomas W. Humphrey 
(R. No. 34,353), David E. Pritchard (R. No. 38,273), David H. Brinkman (R. No. 
40,532), Donald M. Hill (R. No. 40,646), Stephen W. Bams (R. No. 38,037), 
J. Dwight Poffenberger, Jr. (R. No. 35,324), Scott A. Stinebruner (R. No. 38,323), 
and of counsel, Herbert C. Brinkman (R. No. 16,955) and Richard H. Evans 
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{R. No. 19,755), all of Wood, Herron & Evans, L.L.P., 2700 Carew Tower, Cincinnati, 
Ohio 45202, telephone no. 513-241-2324, and Pryor A. Garnett (R. No. 32,136), 
Steven W. Roth (R. No. 34,712), Roy W. Truelson (R. No. 34,265), Owen J. Gamon 
(R.No. 36,143), Matthew J. Bussan (R. No. 33,614), Karuna Ojanen (R. No. 32,484), 
Christopher A. Hughes {R. No. 26,914), Edward A. Pennington (R. No. 32,588), John 
E. Hoel (R. No. 26,279), and Joseph C. Redmond, Jr. (R. No. 18,753), my attorneys, 
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Wherefore I pray that Letters Patent be granted to me for the invention 
or discovery described and claimed in the foregoing specification and claims, and I 
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power of attorney, and this petition. 
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false statements and the like so made are punishable by fine or imprisonment, or 
both, under Section 1001 of Title 18 of the United States Code and that such willful 
false statements may jeopardize the validity of the application or any patent issued 
thereon. 
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